Dial: Distributed Interactive Analysis of Large Datasets
نویسنده
چکیده
DIAL will enable users to analyze very large, event-based datasets using an application that is natural to the data format. Both the dataset and the processing may be distributed over a farm, a site (collection of farms) or a grid (collection of sites). Here we describe the goals of the project, the current design and implementation, and plans for future development. DIAL is being developed within PPDG to understand the requirements that interactive analysis places on the grid and within ATLAS to enable distributed interactive analysis of event data.
منابع مشابه
Interacting with Large Distributed Datasets Using Sketch
We present Sketch, a library and a distributed runtime for building interactive tools for exploring large datasets, distributed across multiple machines. We have built several sophisticated applications using this framework; in this paper we describe a billion-row spreadsheet, and a distributed-systems performance analyzer. Sketch applications allow interactive and responsive exploration of com...
متن کاملDIAL- Ein BigBlueButton-basiertes System für interaktive Live-Übertragungen von Vorlesungen
Dieser Beitrag präsentiert DIAL (Distributed InterActive Lecture), ein BigBlueButtonbasiertes System für interaktive Live-Übertragungen von Vorlesungen. DIAL erweitert das Konferenzsystem BigBlueButton derart, dass Studierenden Zugang zum Chatund Umfragesystem aus dem Stand-By ihres Endgerätes heraus ermöglicht wird, d.h. ohne die Notwendigkeit einer permanenten Verbindung. Damit ermöglicht DIA...
متن کاملApache Drill: Interactive Ad-Hoc Analysis at Scale.
Apache Drill is a distributed system for interactive ad-hoc analysis of large-scale datasets. Designed to handle up to petabytes of data spread across thousands of servers, the goal of Drill is to respond to ad-hoc queries in a low-latency manner. In this article, we introduce Drill's architecture, discuss its extensibility points, and put it into the context of the emerging offerings in the in...
متن کاملAn Interactive Visualization Model for Large High-dimensional Datasets
Data visualization gives a direct view of complex data, which is especially helpful for analysis of large high dimensional datasets. However, existing methods often lose simplicity and clarity while rendering large amount of complex data. In this paper, we discuss some essential properties that a data visualization system should have. Also we present an interactive data visualization model whic...
متن کاملDetecting Distributed Scans Using High-Performance Query-Driven Visualization
Modern forensic analytics applications, like network traffic analysis, perform high-performance hypothesis testing, knowledge discovery and data mining on very large datasets. One essential strategy to reduce the time required for these operations is to select only the most relevant data records for a given computation. In this paper, we present a set of parallel algorithms that demonstrate how...
متن کامل